FIGURE la 
(SEQ ID NO: 1 and SEQ ID NO: 2) 



-1101 

TTTAATCATG GAATATTTCA AACATACAGA AAAATCACAG AAAATAAATA ACAACCACTC ATTTATCTTC 

-1031 

TCCCCAACCC CATGTAATAA ATATTAAAAT ATTGTGTTAA ATGCTAAATT TAACACATGC TAAAGGTTCC 

-961 

TGGCTGGATG TGGTGGCTCA CGCCTGTAAT CCCAGTACTT TGGGAGGAGG AGGTGGGAGG ATTGCTTGAG 

-891 

TCCAGGAGCT CGAGACCAGC ATGGGCAACA TAGTGCGATC TCGTCTCTAC AAAAAACAAA AAAATTAGCT 

m 

-821 CO 

GGGCATGGTG GTGTGCATCA GTAATCCCAG TGACTGGGAG GCTGAGGTGG GAGAATTGCT TGAGTCTGGG — H 



-751 

AATTTGAGGC TGCAGTGAGC CCTGATCATG CCACTGCATT CCAGCATGGG CGACATAGCA AAACTTGTCA 

-681 

AAAAAAAAAA AAGTTTCCTC TCTGCCCCAC CATAGACAAC CACTCTTCTG ATTTCTATCT TCGTAGATGA 

AP1 -611 
ATTTTGCCCA TTCTCTTGTA TATGAAAGGA ACCA GACATT AGGCATTCTG GTGTCTGGTT TCTTTCACTT 

-541 

AAGATAAAAT TGAGTTAACC TGTATTGTTG TACAGAACTG CAGTTTGTTC TTTGTTATTT ATTGTAAAGA 

-471 

CAGGGTCTGG CTATGTTGCC TAGGCTGGTC TCGAACTGTT GGCCTCAAGC AATCCACCTG CCAAGCTCTG 

-401 

GGACCACAGG CATGAGCCAT GGCATCTGAT CKGTAGTTTG ATCTTATTTC TTGCTGAGTA GTAGCCCATG 

API -331 
GCA TGACTTT A TTATTTTGG GTGTCCATTC TCCTCTGGAG GGGCTCTGCT TTTTGAAACC ACACCCTGGC 

Ets -261 
CTAGCTCC CC TTCTCCCT GC CTCTCTGCAG GCTCACATCC ACATGCCAAG ACCTCTGCAG CCATTCTGCT 

Ets _ 191 
TCCTG TCCTT CCA CTCCTGT GGGACCTCAG AGAGCTACGG GGCTCCCTGG GTACCAACTG GCTCCT GAGG 

Spl/Sp3 Spl/Sp3 -121 

CCTGGGGGAG GGTGG TCTTC TGGGAGAAGG AAGCCAGGTC CCTGCA GGTT GTGGAGGGGG ACAGAATGAG 

Spl/Sp3 Ets _ 51 

GGTTTTTCCC CAGGAT GTTG TTGGCCCCTG CCCCCAC TTC TGTTCCATAA TTAACCACGC CCCTCC TACC 

Spl/Sp3 +1 20 

CACTGTGCCC CTCTT CCTGC TGTGTGGAGG CCC TGAATCA TTATTTTAAC TACCCCCTGG GAGGGTGAGC 

Et S Ets 90 

A CCTTCTG TG CTCTGTCCCC AACCTTCC AC TTCCCCTCAA CGCGCTGCTC AGGC jATG^ CC TTCGGCACTG 

M T P G T 



o 
o 



FIGURE la (continued) 



160 

TGCTTCTTCT GAGTGgtaag tggggccagg gtgctgggga gaagcttgga ggagttctga ggggactcca 

V L L L S 230 
tctgggaggg caggctgggg gctggtggtc ggctccaacc actcttatga ggagctgagg caggggagtg 

300 

cttcatgtgc gagtggcccg gagtcagtag agtgtgacct gaatgaagag gggctcaggg gctgtgctca 

370 

ggtggcgact aagctacctc tccagctggc tatgttgtcc caggcttccc tgctcccact catggagtcc 

440 

ctggtgtggg tgacagaggt ctccccagcc tcccccggga gtggaagcc acagaagcca ccagggaggg 

510 

ggaaaggttg gacatcacct ccctgggcct nnnnnttccc ccaagtcctg actgcacgta gggaagaggc 

INTRON 1 580 
cccctgctga aaactgcatc agagtcacat tcacgtgcca tcaaaaatca ggcttggctg ggtgcggtgg 

650 

ctcatgctta taatcccagc actttgggag gccgagatgg gcgtatcccc tgaggtcagg agtttgtgac 

720 

cagcctggcc aacatggtga aaccccatct ttaccaaaaa tataaaaatt agccgggcat ggtggcgtgc 

790 

acttgtaatc ccagctactt gggaagctga ggcaagagaa tcgcttgaac ccaggagacg gaagttgcag 

8 60 

tgagctgaga tcgtgccgtt gcactccagc ctcagcaaca gagcgagact ccatctcaaa aaaaaaaaaa 

930 

aaaaaaagaa aaaaaagaaa aagaggctgg gaggtcctag ggattggggc ttctttaact cccagcctcc 

1000 

ccgcccacca aatattcctc agTCCTGGCT TCTTATCATG GATTCAACCT GGATGTGGAG GAGCCTACGA 

V LA SYH GFNL DVE EPT 
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Sequence Range: -1 1390 to 10387 

(SEQ ID NO:3) FIGURE 9 

Protein Sequence: SEQ ID NO: 2 

Translational stop codon for CDllc 

-11321 

TGAT CCCTCT TTGCCTTGGA CTTCTTCTCC CGCGATTTTC CCCACTTACT TACCCTCACC TGTCAGGCTG 

-11251 

ACGGGGAGGA ACCACTGCAC CACCGAGAGA GGCTGGGATG GGCCTGCTTC CTGTCTTTGG GAGAAAACGT 

-11181 

CTTGCTTGGG AAGGGGCCTT TGTCTTGTCA AGGTTCCAAC TGGAAACCCT TAGGACAGGG TCCCTGCTGT 

-11111 

GTTCCCCAAA AGGACTTGAC TTCGAATTTC TACCTAGAAA TACATGGACA ATACCCCCAG GCCTCAGTCT 

-11041 

CCCTTCTCCC ATGAGGCACG AATGATCTTT CTTTCCTTTC CTTTTTTTTT TTTTTCTTTT CTTTTTTTTT 

-10971 

TTTTTTGAGA CGGAGTCTCG CTCTGTCACC CAGGCTGGAG TGCAATGGCG TGATCTCGGC TCGCTGCAAC 

-10901 

CTCCGCCTCC CGGGTTCAAG TAATTCTGCT GTCTCAGCCT CCTGCGTAGC TGGGACTACA GGCACACGCC 

-10831 

ACCTCGCCCG GCCCGATCTT TCTAAAATAC AGTTCTGAAT ATGCTGCTCA TCCCCACCTG TCTTCAACAG 

-10761 

CTCCCCATTA CCCTCAGGAC AATGTCTGAA CTCTCCAGCT TCGCGTGAGA AGTCCCCTTC CATCCCAGAG 

-10691 

GGTGGGCTTC AGGGCGCACA GCATGAGAGC CTCTGTGCCC CCATCACCCT CGTTTCCAGT GAATTAGTGT 

-10621 

CATGTCAGCA TCAGCTCAGG GCTTCATCGT GGGGCTCTCA GTTCCGATTC CCCAGGCTGA ATTGGGAGTG 

-10551 

AGATGCCTGC ATGCTGGGTT CTGCACAGCT GGCCTCCCGC GGTTGGGTCA ACATTGCTGG CCTGGAAGGG 

-10481 

AGGAGCGCCC TCTAGGGAGG GACATGGCCC CGGTGCGGCT GCAGCTCACC AGCCCCAGGG GCAGAAGAGA 

-10411 

CCCAACCACT TCCTATTTTT TGAGGCTATG AATATAGTAC CTGAAAAAAT GCCAAGCACT AGATTATTTT 

-10341 

TTTAAAAAGC GTACTTTAAA TGTTTGTGTT AATACACATT AAAACATGCA CAAAAAGATG CATCTACCGC 

-10271 

TCTTGGGAAA TATGTCAAAG GGTCTAAAAA TAAAAAAGCC TTCTGTGGAT ATGAGTCCTG AAGGATGACA 

-10201 

CCCATGGGGT CCCTTTACCA CGGTGGACCC TGGCCAGCAC TGAGGCCTGG GGCCAGGACA AGAAGTTAAC 

-10131 

CAGAGTAGGG TTGTGAATAT CCCTCTCTTG GAAGTAACCT GACCTCTTAA TCTGCTCACT CCACTCTCAG 



FIGURE 9 (continued) 



-10061 

GGCTGGTGCC GATGGTAAGC TGGTGGAGCT GTCGGGTGGA GGGGGCATAG AATAGAGAAG GGACAACCTC 

-9991 

CAGTGGCTAC TTTTCCACCT GGAAAGGTCT CTGGAGTGAC CAATACTCAC AAGCGTTTCC TACAAGTCCT 

-9921 

AGGATGTGTT GAAGGGCACA CTGTCTGCAT ATAGTGAGTG ATTGAAGAAC ATGTTGGGGT CCCACATTGA 

-9851 

GAGCTGCTGC CCACAATAAG GTCATTCTTG CTATTATGCC ACCATCCTGG CATAAAGTTC ATCATGGTGC 

-9781 

TTGGCACTGA GCTGGGGGCC TCACAGGACA AGCCATTCCT GACCTCGGAG TGACGCCACT GCAGCTATCA 

-9711 

CCAGCAAGGG ACCCGGGCCG TGTGGATGTT TCAATTAGAA AAACAGAAGG GAGGCAGTTG AGTGATTTGA 

-9641 

AGGGAAGATG GAAAGTGGCC CTTTACCTCC AGCCAAAAAT GTCTGTCCTA TACATCAGCA GAGGCTCCAA 

-9571 

AATCCCTGTG GATTTTGAAG CTTTTGAGTC CCCAGGATGA CTAATTATTA TGCAGTTTCC TCAGAAAGGG 

-9501 

AATCAGAAGA TAAGGCTTTG TAAGAATTCA GCCCTAATGG CTGGGCACAG TGGCTCATGC CTGTAATCCC 

-9431 

AGCACTTTGG GAGGCCGAGG CAGGAGGATT GTTTGTGCTC AGAAATTTGA GACCACCCTG GGTAATATAT 

-9361 

TGAAACCTTG TGTCTACAAA AAAATTTAAA AATTACCCAG GCATGGTGGC ATGTGCTTGT AGTCCCAGCT 

-9291 

ACTTGGTAGG CTGAAGCAGG AGGATCACTT GAGCCTGGGA GGTTGAGGAT ACAGTGAGCT GTGATTTGGA 

-9221 

CCACCACACT CCAGCCTGGG CAACAGAGAA AGATCATGTC TCAGAAAAAA AAAAAAAAAT TGACCCTAGA 

-9151 

GTGGTGTTTC TCAAAATGTG TTCCACGAAC CACTGGTGGT CAATGATGGT CTTCTAAGTG GAAGGTTTTA 

-9081 

GAGAAAAAGA GCAAGAAACC CATACATCTC AAACATTTGA AACTAGTGAT TTGCACAGAA ATAGTGTTGT 

-9011 

GGCCTTAATA ATTGTGTGGC ACACGGACTC CAGGGACTAC AGTGGGTTCT TGTCTAAATT CAGGCAACAA 

-8941 

GTTGTTATTT TCTATTTTAT TTTATTATTA TTATTTTTTG AGATAGTCTC ACTTTGTCTC CCGGGCTGGA 

-8871 

GTGTAGTGGC ACGATCTCGG CTCAACGCAA CCTCTGTCTC CTGGGTTCAA GTGATGCCTC TGCCTCAGCC 



-8801 

TCCCAAGTAG CTGGGAGTAC AGGGGCGTAC CACCATGCCC ATTTTTATTT ATTTATTTTT GAGACAGAGT 



CTCGCTCTGT 
CAAGTTCAAG 
GCTAATTTTT 
TCATGATCCG 
AGGTGTTATT 
TGGCTATTGT 
TTAAAAACTA 
GCCCAGTGCA 
AGAAGGTATT 
CATTCTGAAA 
TT ATT GAG AT 
TTTGCCTCCT 
CCACGCCCAG 
ACCCCTGACC 
CTTTCCTATT 
TCAAAGGTGC 
CAGTAGAGTT 
AAGAGCAGAC 
CATTGACAGA 



CACCCAGGCT 
CGATTCTCCT 
GTATTTTTAG 
CCCTCCTCGG 
TTCTGATATT 
GTTGCATTAG 
ATCTAATTGC 
CGTATAGTAT 
ATACTCCCTG 
CTTTATCAAA 
GGAGTCTTGC 
GGGTTCAAGT 
CTAATTTTTG 
TCATGATCTG 
CAAAAGTGAT 
TTTTTAAAGT 
CATACCTGCT 
AGAAGACCTT 
TTCTTTCTGA 



FIGURE 

GGAGTGCAGT 
GCCTCAGCCT 
TATAGATGGG 
CCTCCGAAAG 
CTTCCTTTGT 
ATCCCCGAAT 
TACAATTTAC 
AG AC AT GC AT 
ACAGGTAGGT 
CAGTCGCATG 
TCTGTCGCCC 
GATTCTCCTG 
TATTTTTAGT 
CCTGCCTCAG 
TAGTGTTGTC 
AGCAAAAGTA 
ATTGAAAGGT 
TACTCTGCAA 
TCTTCCAATT 



9 (continued) 

GGCATGATCT 
CTGGAGTAGC 
GTTTCACCAT 
TGCTGGGATT 
GTGTTATTGT 
CACAAAATGG 
AATAATGTCA 
ACCGGAATAA 
TGTATTGGTT 
AACCTCTGAA 
AGGCTGGAGT 
CCTCAGCCTC 
ATAGACGGGG 
GTAAAGCAAT 
ACCGAATACA 
GGTACAAGTC 
TGCTTTTGGC 
GAGGTTCAGG 
GATCGTCCTT 



TGGCTCACTG 
TGGGATTACA 
GTTGACTAGG 
AGAGGTATGA 
GTACATTTGT 
ATCAATGGCT 
TCAAAGTCAA 
GTGATTGTGA 
CTGACATGTA 
AGCTTTTGTG 
GCAGTGGCAT 
TTGAGTAGCT 
GTTTCACCAT 
AGAGATTCTT 
GAGGAGACAG 
ACATAATTTC 
TGCAAATAAT 
TGCAGGTTAG 
GCGGGGGCGG 



CAACCTCCGC 
GGCAGGCACC 
CTGGTCTCGA 
GCCACTGTAC 
TACATTTGCA 
CAAAAGCATG 
TATTGACTTT 
GCCAAAACCC 
TTTGTCCCTA 
TTATTTTCTT 
CATCTTGGCT 
GGGATTACAG 
GTTGGTCAGG 
AGAACAACTG 
CAAAACCACA 
CAAGAAGCTT 
AGAAAAAAAC 
TGTTTAATGC 
TTTAGTTCTT 



-8731 
CTCCCAGGTT 

-8661 
ACCATTTCCA 

-8591 
ACTCCTGACC 

-8521 
TTGGCCGACA 

-8451 
TTTTCAGGGT 

-8381 
GAAGTTGTGA 

-8311 
TAAATATTGA 

-8241 
GAAAATATCT 

-8171 
GTGTGCTGCC 

-8101 
ATTTATTTAT 

-8031 
CACTGCAACC 

-7961 
GCGCGCACCA 

-7891 
CTGGTCTCGA 

-7821 
CTACATGTAG 

-7751 
GTGACATAAA 

-7681 
GTAGAAATGG 

-7611 
AAAAGCATGT 

-7541 
AGAGTCTCAG 

-7471 
TCCCACTGAC 



FIGURE 9 (continued) 

-7401 

TAGGATTGGG TCAAATTCCA TCCCCTTGGT TGCATGCAGT GCTGAGAAGG TGAGCATGTG CTTTTCACAG 

-7331 

GCTTAATAAA AAGAGGTAGC TCCAGCCAGG TGCAGTGACT CATGCCTATA ATCTCAGCAC TTTGGGAGGC 

-7261 

AGAGGTGGGT AGGTCACCTG AGGTCAGGAG ATTGAGAACC AGCCTGACCA ACATGGCAAA ACTCTGTCTC 

-7191 

TACCGAAAAT ACAAAAATTA GCTGGGCATG GTGGCAGGTG CCTGTAATCC CAGCTACTTG GGAGGCTGAG 

-7121 

GCAGGAGAAT CGCTTGAACC TGAGAGGTGG AAGTTACAGT GAGCTGAGGT CATGCCACTT GCACTCCAGC 

-7051 

CTGGGGGACA GAGTAGAACT CTGTCTCAAC AAAAAAAAAA AAAAAAAGAG AAAAAAAAAG GAGGGTAGCT 

-6981 

CCACCAGCCA GGAAGGTGGC AGCGCTGGTG GCTGTTGGAT AGGCTACCTA CAGTGTCTGG CAAATACTAT 

-6911 

GCTTGAAGAC TATGCTGTGA GCAAGATTCC TTTGTGAAGG AACAGCTTGG ACATTGTGTA TGTCAGAGGT 

-6841 

ATACAGCAGA ATAGCAGTGA CTAACGCTTG TGTGGGAGAG CAAGCATGTC ACCTCATACT TGGAATAACT 

-6771 

CACTGCCATA CAAAGTCTGA ATCAGCTTTC GTCTTTGTGC AACACATGTA TGTGGGAGCT TTTCAGCTGC 

-6701 

TGAAACCTCT AGTGACAGAA AAGGAGGTTT TGTTGTTCAT TTGTAATTAA TGTTAATCCT ATGAGTGGTG 

-6631 

G GAG AG AT AG TGAGGTAGGA GATCAGCAGG ACCTGTTTTC TGGTCACAAC CCAGCTAATC AGAGCATGAT 

-6561 

CTGGTCAAGA TGGGATGCAC TAAAAAAACA GCCCAAACCA GCAGATGGCC AGGAAAGCAA ACTCTCATTA 

-6491 

CCCTCGCCAC TTATTAGCAT AAAGACACTC CCACCGGTGC CATGACAGTT TACAAATGCC ATGGAAACAC 

-6421 

ACCATAGCAA CGGTCAGCAA GTTACCTCAT ATGGTTCTGG AAACTCCCCA CACCTTTTCC AGATAGTTCT 

-6351 

GAATAACCCA CCCCTTAATT TGCATGTAAT TAAAAGTCGG TATAAGTACA GTTAGCCAGC AGCCCACTGG 

-6281 

CTGCTACTGT GGGCTCACTG CCTATGGGTT GTCCTGCTCT GCAAGGAACA GCTACCTTGC TGCCACTGCT 

-6211 

GCTTCAATAA ACCTGCTTTC TTCCACCACA GGCTCGCTCT TCAGTTCTTT CCTGAGCAAA GTTAAGAACC 



-6141 

CTCCCGGGCT AAGCCCCAAT TTTGGAGCTT GCCTGCCCTG CATCAGTAGA ATGGGCTAAC TACTTACGGT 



FIGURE 9 (continued) 



-6071 

GCACTCAGGC TAAAGAGGCT GATGCTTGCA GGGCAGTATT CACAGAGCAC ACGGTAGTTC ACGGGATGCC 

-6001 

TCTCACCCTT GACTCAGTGC TTAAGAAAGG AGGGAAAATG GTGAACATGA TCAAATCATG GCCATTGCCT 

-5931 

ATTCATCTTT TCAGTGTTGT ATGGAGGAAT AGGCAAGTAG GAGATTGCTT TTCACATTAA TGTCAAAGAG 

-5861 

AAAGATAGTT ACTTGGAACT TAAAAAAATT AATTGTGATA AAATATACAT AACATAAAAT TTACCATCTT 

-5791 

AACCATTTTT AAGTATAGCC AATCTCAAGA GCTCTTTCTA TCTTGTAAAA CTGAAACCCT ATACCCATTA 

-5721 

AACAACTCCC AATTCTCCCC TTTCCCTAAC TCCTGGCAAC CACAATTCTT TCTGTCTCTA TGAATTTGAC 

-5651 

TGCTTTGGCA TGTCATAGAA ATAGACTCAT ACAGCATTTG TCTTTTTGCG ACTGGCATAT TTTGCTTAGC 

-5581 

ATAATGTCCT CAAGGTTCAC CCATGTGGTA GCATGTGTCA GAATTCCTCT CCTTTTGAAG GCTGAATAAT 

-5511 

ATTCCATTGT GTGTATATAC CACGTTTTGT TTATCCATTT GCCCATCAAT GGGCATTTGG GTTGCTTTTT 

-5441 

TTGCCTCTCA TGAATGATGA ATATGGGCGC ACAAATATCT CTTCAAGACC ATGCTTTCAA TTCTCTTGGG 

-5371 

TATACACCCA GAAGTGGAAT TGCTGAATCA TATGGTAATT TTTTTTTTTT TTTGAGACAG AATCTTGCTC 

-5301 

TGTTGCCCAG GCTGGAGTGC AGTGGCACAA TCAGAGCTCA CTGCAGCCTT GGTCTTCTGG GCTCAAGCGA 

-5231 

TCCTCTTGCT TCAGCCTTCC GAGCTTCTGG GACTAAAGGT GTGTGCCATC ATGCCTGGCT AATGTTTTAA 

-5161 

AAACGTTGCC AGGCATGGTC GCTCGTGCTT GTAATCCTAG CACTTTGGGA AGCTGAGGCA GGTGGATCCC 

-5091 

CTGAGGTCAG GAGTTTGAGA CCAGCCTTGC CAACATGGTG AAATCCCGCC TGTACTAAAA ATACAAAAAT 

-5021 

TAGCTGGGTG TGGTGGCATG TGCCTGTAGT TCCAGCTACA GGCAGGAGAA TTGCTGGAAC CTGGGTGGCA 

-4951 

GAGGCTGCAG TGAGCCGAGA TTGCACCACT GCACTCCAGC CTGAGTGACA GAGTGAGACT CTGTCTCAAA 

-4881 

AAAAAAAAAA ATTTTAGAGA TGGTGTCTCA CTGTGTTGCC CAGGCTGGTC TTGAACTCCT GCCCTAAAGT 



-4811 

GATCCTCCTG CTTCCGCCTC CCAAAGTGCT GGGATTACAG GCATTAGCCA CCATGCCTGG CCTAGCTAAA 
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-4741 

TTGTCTTTAA TGTCGCATGT CTGCAAAAAA CACATCTATA AAGCTAGAAA AGTTGAGCAT CCAACTTTTT 

-4671 

ATGATTTAAC TCTCATGACC TGGCAATTTT TCTAGCAAGG AGCCTGGGCT GGTGGTTTTA GGAGAACTGA 

-4601 

GTGAAAAAAA GAAATACATT AACTAGATTG GATGCAAAGT GCCTGCTGGT CATGGGTGTT TTCTGCTGGC 

-4531 

CCCTGTTCAT CTGTGCCTGT TAGCCCACCC ATGGGTGAGT GGGGCAAAGT GGCCAAACTG ATTCTTAAGA 

-4461 

GAGGCATACA TGCAGAATCC AAGTTAGTCA TGATTTCGTT TCTAGTCTGA GTGAATGTGT GTCCAGAATA 

-4391 

TTTTATAAAC TTTATCAGCT CAGAGGGGAA AACCTGTCTC CATACTACGT GGTTTATACA AAGCTGTCAG 

-4321 

GAATTCAGCA TGATGAAGAA ATGCACAAAA CAAGTGTGAA CAGATAAGTA AAAGGATCTA CTGAAAATCT 

-4251 

TCAGGGTAGT ATATTGTGTG ACAGGACCAA GAATTTGAAG TCAACATCTG TATTTGTGCC CTCTGGACAA 

-4181 

AGGTATTATC CCTGATGATA TAAAAATTAA TTTTGGGCTG GGTGTGGTGG CTCATGCCTG TAATCCCAGC 

-4111 

ACTTTGGGAG GCTGAGGAGG GTGAATCGAC TGACGTCAGG AGTTGGAGGC CAGCCTGTAT CGACTAATAA 

-4041 

TACAAAAAAA TTAGCTGGAC ATGGTGGCGT GCACCTGTAA TCCCAGCTAC TCAGGAGGCT GAGGTGGGAG 

-3971 

AATTGCTTGA ACTCGGGAGG CCCAGGTTGC AGTGAGTCGC ACCACTGCAC TCTAGCCTGG GCGACAGAGT 

-3901 

GAGACWCCGT CTCAAAATAA ACAAAATTAA TTTCGAGGCC AGGTGCAGTG GCTCCAGGTG CGGTGGTTCA 

-3831 

TACCTGTAAT TCCAGTGCTT TAGGAGGCCA GAGGATTGCT TGAACCCAAC AGTTCGAGAT CAACCTGGGC 

-383 > 

-3761 

AACATCAGTG AGACTCCATC TGTAGAAAAC AATCAAACAG ACAAACAACA ACAACAAAAA AACCAGAGGT 

-3691 

GGGAGGATCA CTTGAGCCCA GGAGTCCGAG GCTGCAGTGA GCTATGGTCA CGCCACTGCC CTCTAGCTTG 

-3621 

GGCAACAGTG CCAGACTCTG TCCTTAACAA CAACAACAAC AAAAATTAAT TCTACTTTAA CTGTCAGTTT 

-3551 

CATGATATCC TTCTATTAAG AAAAACCTTT TCTATCTGAT GAACTATTGG CTAGGTTTTC TTTCTCTCTG 

-3481 

CTTTTGACTA ATGCATTTAA TTACTTTCAT TTGCAAACTC TATCCTTCTC ATCAACTTTG TATTTTAGAT 



FIGURE 9 (continued) 

-3411 

GTGTCTATTG ACAGCCTGGC TTCCCTCAGC GATCATTATG ATGATCAAAG TAGATGAATA GGTAAAATTC 

-3341 

AATGCAAATA TTCCAGGGCA TCTAAATCCA TACCCCAAAT GGGAAAAGGG GAGAATTGGA AGCCAGCAAT 

-3271 

TTGAACACAT TACTATGGAT GTATTTTTCT CATGCGGGGG AAAAAGTGAT TTGGAGAGAG AGAATTATGA 

-3201 

ATGCATGTGA AGAATAAAGC CAAATTTCCT GGGAGGAGGG GAAGACCAGG AGAAACAAAA CCAAATCCTG 

-3131 

GCTGTGGCCT CTAAGGCATG GGGACCTGGA GTTATGCTCT CCAGGCAGAC ACAGCTCATT CTGGAGAAAG 

-3061 

GCTGCAAAAA TATTCTCCTT CACATTGATT TGAAAACAAT TATTAAATTC TTGTTTTCTT ATTTATCTAA 

-2991 

GTGTAACTTT TTAAAACTTA CTGAGAGAAG ACGGGCACGG TGGCTCACTG CTGTAATCCA GTACTTTGGG 

-2921 

AAGTCAAGGC AGGTGGATCA CCTATGGTCA GGAGTTCGAG ACCAGCCTGG CCAATATGGC AAAACCCCGT 

-2851 

CTCTACTAAA AATACAAAAA TTATCAGGTG TGGTGGTGTG TGCCTGTAAT CCCAGCTACT CGGGAGGCTC 

-2781 

AGACAGGAGA ATCACTTGAA CCTGGGAGGC AGAGGTTGCA ATGAGCTGAG ATTGCACCAC TGCACTCCAG 

-2711 

CCTGGGCGAC AGAGCAGGAC TCCATCTCAA AATAAAAATT AATTGATTAA TTAATTAAAA ATTTACTGAG 

-2641 

AGCTGGTGGT TCCTTTAAGG GTGGAGCCGC CATCAAGTCC CCAGAGGATG CCCTGAATTT GGGGGCATCA 

-2571 

CCTTCAGCTG CTGTGGACTC TGAGCCTTGG CAGCTCCAGC TCCAGGCCTG GGAGAAAGAT GATTTCCTGG 

-2501 

CAGCGTGCAG TGATTGTGAG CATTTGACTA CCTTACTGCA TTTTGCCCTT ATCAKTGCTC TCCAAACATG 

-2431 

AGTGGAAAAC AAAAAATTTT GCTGAGACAA GCGATAATAC GAGTTAGGGA AAGTTGGAGA ATTTTATAGT 

-2361 

TGCTGATATC AGCAAATCGT GAGTTTCAAG CACTAACTTA CAGAAGGAAG TCCAAAATTA AAGGGGATAT 

-2291 

AGAAATGTGT AAAAGATGAG GTGTGGTGAA GATGGAGAAA ATGAAGAGCT CTTTAAATTT CTGAATTATG 

-2221 

AAGAATCACC AACAAATTAT TTTGTGGTTC CAAATACAGG GAGAAGTTCA CAGATCCACA GAACTGATGA 



-2151 

CAGGGTGCGG CCAGCCACAA ACCTTTCAGC ACAAGAGGGA GAAGGCTGCC GCTCCACTTT GCCTGGGCAG 
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-2081 

TCTTTGTAAG GCAGTAGATA AGTCAGCCTC GAAGTTAGCA ATCACAGCCC TCGGCTCGGT TTCCTGCAAG 

o^.o N486 

" 2052 \ ^ -2011 

GGCATCGTTA ATGCATCACA ATTAATTTCT TCTGTCCATT AAATGTCAGC TCTCAAGTAA ATTGATGTAA 

-1941 

AATTTTTGTA TAGAAAACTA TTTCATATTA TTTGCACTTG ATGTTTAATT ACATTTTAAA TGTTTTGTTT 

-1871 

GTTTCATTTT GTTTTGTTTT TGAGACAGAG TCTTGCTCTG TTGCCCACGC TGGAATGCAG TGGTGTGATC 

-1801 

TTGACTCACT GCAACCTCTG CCTCCTGGGT TTAAGCGATT CTCCTGCCTC AGCTTCCTGA GTAGCTGGGA 

-1731 

TTACAGGCGT GCACCACCAT GCCTGGCTAA TCTTTGTATT TTTAGTAGAG ATGGGGTTTC ACCATGTTGG 

-1661 

CCAGGCTGGT CCCGAACTCC TGACCTCAAG CTATACACYT GCCTCAGCCT CCCAAAGTGC TGGAATTACA 

-1591 

GACATAAGCC ACTGTGCCCA GCCAAATGTT TTAAATAATT GTCACATATA TATACAAAAT AATTTATGTT 

-1521 

ATAGGTAGGG ATCTTGTTAT ATTTTAACCT TCAAAGTATA TTCCTAAGCT TTTTATTTAT TTTTTATTTT 

-1451 

TTATTTATTG AGACAGTCTT GCTCTGTCGC CCAGGCTGGA GTGCAGTGGC GCAATCTCGA CTCACTGCAA 

-1381 

ACTCTACCTC CTGGGTTCAA GCGATTCTCC TGCCTCAGCC TCCTGAGTAG CTGGGATTAC AGGTGCGCAC 

-1311 

CACCATGCCC AGCTAATTTT TGTATTTTTA GTAGAGACGG GGTTTCACCA TATTGGCCAG AGCTGGTCTC 

-1241 

AAACTCCTGA CCTCAGGTGA TCCATCCACC TCAGCCTCTC AAAGTGCTGG GATTATAGGT GTGAGCCACT 

-1171 

GCGCCTGGCC TATTCCTAGC CTTTTATATA TAGACCTTTT TCTTTTTCAC ATTTTAAAGG AACTTTTATG 

-1101 

TTTAATCATG GAATATTTCA AACATACAGA AAAATCACAG AAAATAAATA ACAACCACTC ATTTATCTTC 

-1031 

TCCCCAACCC CATGTAATAA ATATTAAAAT ATTGTGTTAA ATGCTAAATT TAACACATGC TAAAGGTTCC 

-961 

TGGCTGGATG TGGTGGCTCA CGCCTGTAAT CCCAGTACTT TGGGAGGAGG AGGTGGGAGG ATTGCTTGAG 
N485 

| >- -891 

TCCAGGAGCT CGAGACCAGC ATGGGCAACA TAGTGCGATC TCGTCTCTAC AAAAAACAAA AAAATTAGCT 



-821 

GGGCATGGTG GTGTGCATCA GTAATCCCAG TGACTGGGAG GCTGAGGTGG GAGAATTGCT TGAGTCTGGG 
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-751 

AATTTGAGGC TGCAGTGAGC CCTGATCATG CCACTGCATT CCAGCATGGG CGACATAGCA AAACTTGTCA 

-681 

AAAAAAAAAA AAGTTTCCTC TCTGCCCCAC CATAGACAAC CACTCTTCTG ATTTCTATCT TCGTAGAT68 

API -611 
ATTTTGCCCA TTCTCTTGTA TATGAAAGAA ACCA GACATT AGGCATTCTG GTGTCTGGTT TCTTTCACTT 

-541 

AAGATAAAAT TGAGTTAACC TGTATTGTTG TACAGAACTG CAGTTTGTTC TTTGTTATTT ATTGTAAAGA 

-471 

CAGGGTCTGG CTATGTTGCC TAGGCTGGTC TCGAACTGTT GGCCTCAAGC AATCCACCTG CCAAGCTCTG 

-401 

GGACCACAGG CATGAGCCAT GGCATCTGAT CKGTAGTTTG ATCTTATTTC TTGCTGAGTA GTAGCCCATG 

API -331 
GCA TGACTTT A TTATTTTGG GTGTCCATTC TCCTCTGGAG GGGCTCTGCT TTTTGAAACC ACACCCTGGC 

Ets -261 
CTAGCTCC CC TTCTCCCT GC CTCTCTGCAG GCTCACATCC ACATGCCAAG ACCTCTGCAG CCATTCTGCT 

EtS -191 
TCCTG TCCTT CCAC TCCTGT GGGACCTCAG AGAGCTACGG GGCTCCCTGG GTACCAACTG GCTCCT GAGG 

Spl/Sp3 Spl/Sp3 _ 12 i 

CCTGGGGGAG GGTGGTCTTC TGGGAGAAGG AAGCCAGGTC CCTGCAGGTT GTGGAGGGGG ACAGAATGAG 



Spl/Sp3 EtS -51 

GGTTTTTCCC CAGGAT GTTG TTGGCCCCTG CCCCCA CTTC TGTTCCATAA TTAACCACGC CCCTCCT ACC 

Spl/Sp3 V 20 

CACTGTGCCC CTCTTC CTGC TGTGTGGAGG CCCT GAATCA TTATTTTAAC TACCCCCTGG GAGGGTGAGC 

Ets Ets 90 

A CCTTCT GTG CTCTGTCCCC AACCTTCC AC TTCCCCTCAA CGCGCTGCTC AGGG^TG^CC [tTc|gGC^C'iJg 

M T F G T 

160 

TGjcTTjcTljCT GjAGljGGTAAG TGGGGCCAGG GTGCTGGGGA GAAGCTTGGA GGAGTTCTGA GGGGACTCCA 
V L L L S V 

230 

TCTGGGAGGG CAGGCTGGGG GCTGGTGGTC GGCTCCAACC ACTCTTATGA GGAGCTGAGG CAGGGGAGTG 



CTTCATGTGC 
GGTGGCGACT 
CTGGTGTGGG 
GGAAAGGTTG 



GAGTGGCCCG 
AAGCTACCTC 
TGACAGAGGT 
GACATCACCT 



GAGTCAGTAG 
TCCAGCTGGC 
CTCCCCAGCC 
CCCTGGGCCT 



AGTGTGACCT 
TATGTTGTCC 
TCCCCCGGGA 
NNNNNTTCCC 



GAATGAAGAG 
CAGGCTTCCC 
GTGGAAGGCC 
CCAAGTCCTG 



GGGCTCAGGG 
TGCTCCCACT 
ACAGAAGCCA 
ACTGCACGTA 



300 

GCTGTGCTCA 
370 

CATGGAGTCC 
440 

CCAGGGAGGG 
510 

GGGAAGAGGC 
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580 

CCCCTGCTGA AAACTGCATC AGAGTCACAT TCACGTGCCA TCAAAAATCA GGCTTGGCTG GGTGCGGTGG 

650 

CTCATGCTTA TAATCCCAGC ACTTTGGGAG GCCGAGATGG GCGTATCCCC TGAGGTCAGC AGTTTGTGAC 

720 

CAGCCTGGCC AACATGGTGA AACCCCATCT TTACCAAAAA TATAAAAATT AGCCGGGCAT GGTGGCGTGC 

7 90 

ACTTGTAATC CCAGCTACTT GGGAAGCTGA GGCAAGAGAA TCGCTTGAAC CCAGGAGACG GAAGTTGCAG 

8 60 

TGAGCTGAGA TCGTGCCGTT GCACTCCAGC CTCAGCAACA GAGCGAGACT CCATCTCAAA AAAAAAAAAA 

930 

AAAAAAAGAA AAAAAAGAAA AAGAGGCTGG GAGGTCCTAG GGATTGGGGC TTCTTTAACT CCCAGCCTCC 



CCGCCCACCA AATATTCCTC AGTcJcTcbc'lj TCT^AtIcatJg G^TcKacIcT GlGATjGTGjsAGl 

L A s'y'h'g 1 F ' N L D V E 

TcjTTcjcAdGA GjGAlfcCAbGC I GGCtTTllGGdc AGlAGdGTGlGT dCAdTTdGGT I GG^CTlcGGT 
I F Q'E D A G G F G Q S v'v.q'f'g ' g's r 

CACCCTCCTT CCCCAACCTC CACTACATCA AGTCCTGTGG ATGGGTACAC GTGGGTTACC 



GAGbcTkcdA 
E P T 

1070 
AGGCCCCACT 

1140 
CGAGGGAGGT 



1210 

GTCCTGGAGG AAGGCCAGCA GGGGTGAGAA GTCTTCCCTT GGCTCCTTGG AGGCCCTGAC ATCAGCACCT 



1280 

ATTATTCTCA ATCCCAGGAA AGGCCACAAA ACTCTAGACA AGACCCTACC TTACCTCGGG AGGGAAGCCT 



1350 

TGAACCTGCC TCCCAGGCAG GGCCCACTTC TTGGGGCCAG TATGGTCACA CAGGGCCCAC ACTCATTAAC 



1420 

TTTGGAGTTT AATGTTCTGC CCTTGACCTC TTGAAATTCC TGATTATTTT TATTTTTATT TTTACTCCAG 



1490 

CTCTGTTACC CAGGCTGGAG TGCAGTGGTG CAATCACAGC TTACTGCAGC CTCAAACTCT CGGGCACAAG 



1560 

TGATCCTCTC ACCTCAGCCT CCTGAATAGC TGGGACCACA GGTGCATGCC ATCATGCCTG TTTTTTGTTT 



1630 

TGTTTTGTTT TACTTTTTAC AGAGATGGAG TCTTGCTATG TTGTCCAGAC TGGCTGAACT CCTGGGCTCA 



1700 

AGCAATCCTC CTGCCTTGGC CTCCCAAAGT GCTGGGATTA CAGGTGTGAG CCACCCTGTC TTGCCAATTC 



1770 

TTAAAAATTT TATCTGTGCA TTTGTGTTTT GCAAGTAAAG AATGATGGCA GGGCTGGGCA CCATGGCTCA 



1840 

CGCCTATAAT CCCAACSCTT TGGGAGGCTG AGGCGGGCAG ATCATCTGAG GCCAGGAGTT TGAGACCAGT 



FIGURE 9 (continued) 

1910 

TTGGCCAACA CAGCAAAACC CCATCTCTAC TAAAAATGCA AAAAAAATTA GCCGGGCATG GTGGCAGGCA 



1980 

TCTGTAATCC CAGCTACTTG GGAGGCTGAG GCAGGAGAAT CGCTTGAACC TGGGAGGTGG AGGTTGCAGT 

2050 

GAGCCGAGAT CGTGCCACTT TACTCCAGCC TAGGTGACAG AGTGAGACTC CGTCAAAAAA AAAAAGTCAT 

2120 

GGGAGAAGGG AGATGTACTG GGGGTTTGGA GCCTTAGCTC AGCAGCAGCC CCACCTCCCA CCGCCTCCTG 

2190 

AAGGGTGGTG AAGGGGTATC AGCTGCTGGC TCCCCCACCC ATGTGGGAGC AATGACCGCT GCTACCTTCC 

2260 

GCCCCTGGCA TGAGCTGGGT AAAGTCAGTT AGGGGCGCTC ACTCTGGGAG TACCCCGAGG GAGTGGGACA 

2330 

CTACATAGCA AATAAAAAAC GTCAGGACAG GTTGAGGAAA GAGAGCAGAA GAAAGGTAAG AGCCCCCCAA 

2400 

CCCCAAGAGA CCCCACAGTT TTATTTCAAA TTGGGACCCA CAAATTATGA ACCTGCCCCC ACTTCCAGGA 

2470 

GCTCACATTC TCCTGTCCCA GAGAGTTCAA GTCACAATGT GACACAGGTG TCACCAAGGT CTGGGGGGCG 

2540 

CAGGCAGGGA GAGAGCAGAC CCAGGAGGGT TCCATGGAGG AAGTGGTGCT GGCAGTGAGC CCCAGTGGAC 

2610 

AGGAAGGCTC AGTTGGTCAC GAGGAGCTAT AAGAGGTCAC CGAGCTCCAA CCGCGCACCC CTCTCCCTTC 

2680 

CTCATGTGAC TGGCAGTCTG GGGGGATGGA AGCAAGCACC AGGCACCAGG CTTTTGTTTT TCTTTATTTG 

2750 

GAAATGTGGT CAACTGAGGT GCACAAATCT GAAAGACCCA ATCTGATAAA GGATACACAT GTGCGTGCCT 

2820 

GGGTGAGCCC CACCTAGGTC AGCTGCTCCA GTGTCAAATC CCACAGGCAC AGGGCTGCCG TGGACCCCTT 

2890 

CTCATCACCC AACATCCCCA GAGAACCCCT GGTCAGACTT CTGTCACCAT CAGTTTTTTG GGCCACATTT 

2960 

TAAAAAAAGA ATACATTGGC TGAGTGCAGT GGCTTATGCC TATAATCCTA GAACTTTGGG AGGCTGAGGC 

3030 

GGGTGGATCA CCTAAGGTCA AGAGTTCAAG ACCAGCCTGA CCAATATGGT GAAACCCTGT CTCTACTAAA 

3100 

AAATACAAAA ATTAGCCTGG CGTGATGGCA GGTGCCTGTA ATCCCAGCTA GCTGGGTGAC TGAAATAGGA 



3170 

GATTTGCTTG AACCTGGGAG GTGGAGGTTG CAGTGAGCTG AGATCACGCC ATTGCACTCC AACCTGGGTG 
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3240 

ACAGAGTGAA ACTCTGTCTC AAAAAACATA TGGGTTGATG GGTTACACTA AAGTTTTGCT CATCGTTTGT 

3310 

ATCAGCAGGT TCCAAACTGC TACCTCTCTA GCCAATGCTC AGATTTTCTT CACAAAGCCT TAGGCATCCC 

3380 

CTGAATCATG ATGCACAGGG ATTGTAGCTT TCTGTAAAGG AGCGGCACCT AGAAGGAACC CTCACATGGC 

3450 

CATTTAATGA AGCCTTGCTT GGCGCATTAA AATACACCAG ATATCTGTCTG CTTTTCTCAC AGACAGGAGA 

3520 

TTGTGGGTAG TGAGAAAACA TTTCCAAAAT TAAAAAACTT TCCCACTCAG GGAGTTTTGC AAATAAACCC 

3590 

TTGACTCTAC ATAACTATAG ATATAGTTAT GGATCCTAGT ACACTGCTTT ACATTGGCCA ATTGAAATTG 

3660 

CTTATACAAT ATTTAAATTG GTCCAATGAA TTACAGAATC AACTATTTGT TTTGAAAGCA CATGTCTTCA 

3730 

GGAAATTGTT CCAATTAACT TGAGATGATC TTATTTCTTG GGTGGTTCAA AATAATGGCA ACTCAGAAAC 

3800 

GCAATGTGCT TACCCATGAT TGGGAAATGC CATTTTGGTC TTTAAATAGG TCTTTTTTTT TTTTTTTTTT 

3870 

TTTTTTTTTT GGTGAATGTT AAAAGAAAT TTCTAAACAT AAATACACAC ATACGTACTT ATGCACACTC 

3940 

AAAACCAAAT AAACCCCAGC ATGGCCCCTG GGCATCTGTG AGTTACACTT GGGCCCTGAT TTCTGAATAT 

4010 

TCTGCCAAGT GGCAAATGCC AGGAATTTCC CCCACAGAGT CTCGCTTCCC CATGGAGGGA CACTTCCTCA 

4080 

CCCCCAAGTG CCCGCTGCTC CCACCCCTCC TGTGGCTGCA GTGACATGGC CATGGTTGTG TCTCCAGAlCT 

1 L 
4150 

CpTC^GTG|GGA I GCA(CCC(:TG|3 AG|GTG|3TG|3C G|3CC^AC|:AG| ACG|GGA|CGG|c TG|TATjGAC[rG C|GCa|gCt|gCc| 

VVG APLE VVA ANQ TGR LYDC AAA 

4220 

ACCpGCjATGjr GC|CAC^CCC|AT C^CG|;tg£ac| AGTGAGTGAC CACCTGGGAA TTGGGCCCCT CAACCCTCCT 
TGMC QPI PLH I 

Intron 3 4290 
GGACCCAACT GTGCCCCCGC TTAGCTTCCA GTCCAGACCT TCCCCGCAAA TGAGTGTGTG CTGTGAGTGA 

4360 

GACCCCGCGT GTCTGCCCTT GCAGTC|CGC|C Cl|GAGj3CC|ST C^AAC^TGfTCC | TTGjGGC|CT(|\ CC|CTGjGCA|GC 

R P EAV NMS LGL TLAA 

4430 

C|rCCjACC^AC| GGC[rCC|CGG}: TcjcTC^GTGAG TGAGTGTCTT GGGCCACGGG GGGGTGGGGT GGGGCGGGGG 
STN GSRLL 

4500 

GTGTTGTTGG GGAGGAGGCT GGGGCTGGGA GTGAAGGAGG AGGGGCTGCT AGGGACTCCT GGCTCACAGG 

Intron 4 
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cttctgcctc cAGjGccfrG'i|G Gc|:cG^ccjCT g):ac(ag^3Tc| tgt£gg|gag^ Ac|rcA|rAcfrc 

AC GPTL HRV CGENSYS 

tgc|:tc|ct^c T^GGcfrccjcG c|tgg)gag^tc| atc)cag^c?)g tc[:cc|sac|gc c^cg|:ca|3Gt 

CLL LGSR WEI I Q T V PDA TPE 

CAGGCCATGG TTCCCTGTGG AGCACATGCT GGCACTGAGG GTGAGCAGGC GTGAGGCCTG 

4780 

CCTGTGCCCT CCCTGGAGGG CCGAGTGTGG CTAGGAGAGA AGCCAGGAGA AGAGGGTGGC TCAGGCAGGA 

4850 

GCCCTGCTGC TCCAGGGTAG AAGTTCTTTG CAGGGTTTTT CTTTATATTT TTTTCTTTTT AAGACAGGGT 

4920 

CCCTGCCAGG CACAGTGGCT CAGGCCTGTA ATTCCAGCAT TTTAGGAGGC TGAGGTGGGC GGGATCACCT 

4990 

GAGGTCAGGA GTTCGAGACC AGCCTGGCCA ATGTGGTGAA ACCCCTCTAC TAAAAATACA AAACAAAACA 

50 60 

AAACAAAATA GCAGGATGTG GTGGTGTGCG CCTGTAATCC CAGCCACTCG GGTAGGCAGA GACAGAAGAA 

5130 

TCGCTTGAAC CCAGGAGGCG GAGGTTGCAG TGAGCTGAGA TTGTGCCATT GCACTCCAGC CTGGGTGACA 

5200 

AGAGCAAAAC TCCATCTCAA AAAAAAAAAA AAAACAAAAA ACAGAGTTTC TGTCAGGCTG CATGCACCAC 

5270 

CACACCCTGC TAATTTTTTT GAGACAGAGT CTTGCTCTGT CGCCCAGGCT GGAGTGCAGT GGTGCAATCA 

5340 

TAGCTCACTG CAGCCTCGAA CTCCTGGGCT CAAGTGATCC TCCTCCCTTA GCCTACTGAG TAGTTGGGAC 

5410 

TGCAGGTACA TGCATCACAC CTGGCTAATT AAAAAAAATG TTTTTGTAGA AATGGGGGTC TTGCTATGTT 

5480 

ACCCAGCCTG GTCTTGAACT CCTGGGCTCA AGTAATCCTC TGCCACAGCC TCTCAAAGTG TTGGGATGAC 

5550 

AGGCATGAGT CCTTGTGCCT GGCCTGAGGG ATGAAAGTTC TGATGGAGGC AGAGAGGAGC CCCACTGTGC 

5620 

GGGCTGTAGA GGGCACAGCA TCTTCCAGTT GCCAACAGGT GCATGGCCAC TTCTTGAGTT TCAGAGGAAG 

5690 

GACCTTAGTG TGGTAAAGAA CGTGGTGAGG AAGATAAATC CATGAGGGAG GTGTTTCTTC TGGATGGTTC 

57 60 

ACTGCTGAGC TTCCAGGATT CCCCAAACTA ACTTTCCTCT CGAAGAGGAG CAAATGACAG GGCTGCGGAA 



4570 
A|AAG|GGTjrCCj 

K G S 

4640(1700) 
AGGTCCCTGG Intron 

4710 ( " 67 > 

TGTCTGGGQC 



5830 

AATGCGATGT GCAATTTTGT CAGTGCCCAT GTCTTCCACA GAGAACAGGG CCTGGGGACA CCACCATGAC 
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5900 

ATCTCTCTGA GGGTTGGTCT GCATCATGGT GGTTCCCAAG TTTGTTTTCC ATGGGCACCA GGCTTCATTC 

5970 

CCTTGAAGCT TCATTCCCTC AAAGCCATTC AGTTTCCTCA TTGGTAAAAT AGAGCTCAAT AATCAGGGGG 

6040 

TTATGAAGGT GAAAGGGATT GAGGTGCATA AAGCACTTGG AACCCTGCCT GGCACATAGT ATGTGATAGC 

6110 

CCCTCTGACC CATCTTCCAG CTGGGGACTG CATGCTGGGA CTGGGAGGAA GATACAGGCA AACTGTCTCA 

6180 

TCTGCCGTGT GAGAGGGAAT GCCAGGGGCC GCTCAGGGTG CTGACCGAGG GTGGGGCTTC AGACCAGAGA 

6250 

GGCCATGATG ACAGGCATGC TGGGCCTTTA GACAAAGGTG GAGCAGCAGC AGAAACATTA CCAGAGCAAA 

6320 

TGGTGAGGGT GGAGTCTATG GAGGGGACCA AGGGAAGGGG GAAGGGACAT CCAGGGTTCT TGGGGGGACC 

6390 

GTGCCCAGCC TGAGATGTCT GTGAAGCTAG GTTAGGGAGG TGGCACTTAA AAACAAGGGG TAAATGCTTT 

6460 

CTCACAGCCA TCCGTGGAAC TCATGAGGTG GGATGCCTGA TGCAAATGGG ACTGGAGCAC AAAACTGGTG 

6530 

CAGGCAAGGG GGGTGTGGGT CCAAGTAGAA GGGACCAGGG TCCACTGAGG ATCACCTGTG TGCCAAGCAG 

6600 

TGCTGAATAC CTGGTATGAA TCACCTTATT GCATCCTCAC AACATCCTGG GTGGTGGGCA GGCCCATTCT 

6670 

CATTTTACAG ATATGAAAAC CAAGGTTCAG ATAGATGAGT TCCATCGATA GCAAGAGGCA GAGCCCAGAG 

6740 

CTTGAGCCAT CCTTGCCTGA TTGGTGGGGT CCTTTTTCAA AAGGATAAGT CCAGGCTTCT GCTAGTGGGA 

6810 

GACCAGGGGA TACAATAAAA AGACCAAGAA ACAGAAGAGA CATTGTGAGA GGATTTGCCA CAGACCTGGC 

6880 

CTGAGAGAGG ATGAGAGGGT GGTTTCTTGA CGCAGCTGAA AAAACAGGCA CCACTGCAAG ATGTTGGCTG 

6950 

CCCAGATGTG GGCAAAAAAC GGGGAGCTCC TGGGGGGATC TGCAGCCTGC CCCATGGATG TCAAGATTTG 

7020 

CTGGTGATTG AAGAAGCAGG AAGGAAGTGA CCTTCTGTTT CTCCCCAGCA CCCTTGAAGC ACCAGTGGTT 

7090 

GAGCAAGTGG GGTAGGGGAG AGGAAAGAGG AAAAGGCATT TTTTTTTTCT GCAGTGGTGG GCAGGGGGCA 



7160 

GAAACCACAG CCCTGTGGTG TGGGCCTCAC ACCTTAGTGC TCTGGTGGCC TGATCTCCCA GTGCCCTGCG 



FIGURE 9 (continued) 

7230 

GGCAGCACAG GATGTGGCTG CTGGTGGAGG TACCAACTGG GCCCTGAACA CAGGCCACAC ACCCCCCATG 

7300 

AGCCTGGGGA CAGCATGAAA AGTCTTATTT GTTCATGTGG ATATGATGTG CCCTCACGAT TGCAGAGTGA 

7370 

ACTCCACAAA CTCTGAGGTA ACTTGGGAAT GTTCTTTTTT TTTGAGACGG AGTCTCACTC TGTCGCCCAG 

7440 

GCTGGAGTGC AGTGGCACAA TCTTGGCTCA CTGCAGCCTC CACCTCCCAG GTTCAAGTGA TTCTCCTGCC 

7510 

TCAGCCCCCC AAATAGCTAG GATTACAGGC ACCGCCACCA TGCCGGGCTA ATTTTTTTGT ATTTTTAGTA 

7580 

GAGATGGGGT TTCACCATGT TGGCCAGGCT GGTTTTGAAC TCCTGACCTC AAGTAATCCG CCCACCTCAG 

7650 

CGTCCCAAAG TGCTAGGATT ACAGGCGTGA GCCACCACTC CCAACTGGGA ATATTCTTGG GCACCGCACC 

7720 

CATGGGAGCA TGAAGGGTGG ATGCAATGCA ATCATAACAG AGGCCCAAGG TCAGCACTGG GGTGCTTGCC 

7790 

TGTCATCCCA GTGCTTTGGG AGGCCGAGGT GAGTGGATCG TTAGAGCCCA GGAGGTTGAG ACCAGCCTGG 

7860 

GCAACATGGC GAAACTCCCT CTCTACAAAA AGATACAAAA ATTAGCCAGG CAAGGTGGTG CACACCTGTA 

7930 

GTCCCAGCTA CTCAGGAGAC TGAGGTGGGA GAATTGCCTG AGCCTGGGGA GGTCGAGGCT GCACTGACCT 

8000 

GTGATCACAC CACCACACTC CAGCCTGGGT GACAGTGAGA CTCTGCCTCA AAAAAACAAA AAATGAAAAA 

8070 

ACCAGAGGCC TCAGCCAATG CCTGGGGGGC TCAGAGTGCA GCTGGCCCTT CAGACGCTGA ACCAGTCATC 

8140 

GGTAAAGGTT TCCTCCAGGG GCAGGAGGTG TCCCAGTGGG CAACAGTTCC CCTCTGCCTA GCGTGATTCC 

8210 

TGGGAAGGGA CTCAGCTCAG AGCCAACTCC ACGTAGCTGG AAATAAGGAC CTCTGACCGA CTGGGGGTAG 

8280 

GGTGGGGTCT GGGGTGGATC CCTGCCCCAC CCCCACAGCA TCCCTACAGG CATATCCTAC AGGCCTCGAA 

8350 

GGTGCCTGGC ACGTGGTGAG AATGGTGCCA GCGGCTGACC CTGGCAGAGG GCCAGGACTT GTCTCCAGCA 

8420 

CCCATGTGCG TGTTGCTTTA TCCTTGCAGT GATCCCACGT GGTAGCCACT GATATTACCT TCATTTTACA 



8490 

GATAGGGACA CTGAAGTCCA GAGAAGTTAA GTAATGTGCC TGAATTCACC AATTAGCACG TGGCTGAGCT 



TATATTATTT GTAGGGCTTC 



FIGURE 9 (continued) 

AAAACACATG GGAAATGGTT TGTAAATCCA 



8560 

AAATAATTCC AAAATAAAGT 



TTATTAAAAC TGAAAACAAT ATGGCTTGGT GTGGTGGCTC ACACCTGTAA 



9960 

TCCCAGCACT TTGGGAGGCT 



NGAGGTGGGA GTATTGCTTG AGGCCAAGAG TTCGAGACCA GCCTGGGCAA 



10030 

CATAGTGAGA CCTTGTCTCT 



ACCAAAAACA AAACAAAACA AAAAACAAAG CCAGGCATGT GACGTGTGCC 



10100 

TGTAGTTCCA GCTACTTGGA 



GGCTGAGGCA GGAGGATCAC TTGAGGCCAG GAGTTTGAGA GACCCTGTCT 



AACAATAGTA ACAGGCACTG AGCCCTGGGC CCTCCCCACT GGCCTTTGCA 



10170 

CTACAAAAAA TTAAAATAAA 
10240 

_GTTT|3CA|CTG| ATG|3AG|rAC|r 
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rdACCtlTCk CdCAAlTTCfcG GkCCkGCfcCG I AGCfcAGbAGk GCfcTGbTGbA 
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10380 

GCfcTGkCGlrT cKcGbcckcG I GGdATCbTGtv CAbTGjGTGTA AAGCAACCCC 
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